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5-h Abstract 

^* We consider the problem of online dynamic power management that provides hard real- 

time guarantees. In this problem, each of the given jobs is associated with an arrival time, 
a deadline, and an execution time, and the objective is to decide a schedule of the jobs 

i_i as well as a sequence of state transitions on the processors so as to minimize the total 

energy consumption. In this paper, we examine the problem complexity and provide online 

f^\ strategies to achieve energy-efficiency. First, we show that the competitive factor of any 

online algorithm for this problem is at least 2.06. Then we present an online algorithm 
which gives a 4-competitive schedule. When the execution times of the jobs are unit, we 
show that the competitive factor improves to 3.59. At the end, the algorithm is generalized 

p^ to allow a trade-off between the number of processors we use and the energy-efficiency of 

£> the resulting schedule. 

o 

^ 1 Introduction 

Reducing power consumption and improving energy efficiency has become an important design- 
ing requirement in computing systems. For mobile devices, efficient energy management can 
effectively extend the standby period and prolong battery lifetime. For large-scale computing 
clusters, appropriate power-down mechanism for idling processing units can considerably reduce 
the electricity bill. 

In order to increase the energy efficiency, two different mechanisms have been introduced to 
reduce the energy consumed for idling periods. (1) Power-down Mechanism: When a processor 
is idling, it can be put into a low-power state, e.g., sleep or power-off. While the processor 
consumes less energy in these states, a fixed amount of energy is required to switch the system 
back to work. In the literature, the problem of deciding the sequence of state transitions 
is referred to as dynamic power management. (2) Dynamic Speed Scaling: The concept of 
dynamic speed scaling refers to the flexibility provided by a processor to adjust its processing 
speed dynamically. The rate of energy consumption is typically described by a convex function 
of the processing speed. This feature is also referred to as dynamic voltage frequency scaling, 
following its practical implementation scheme. 

For systems that support the power-down mechanism, Baptiste |3l| proposed the first polynomial- 
time algorithm to decide the optimal strategy for turning on and powering off the system for 
aperiodic real-time jobs with unit execution time. In a follow-up paper, Baptiste et al. [4] further 
extended the result to jobs of arbitrary execution time and reduced the time complexity. When 



X 



strict real-time guarantees are not required, i.e., deadline misses of jobs are allowed, Augustine 
et al. p] considered systems with multiple low-power states and provided online algorithms. A 
simplified version of this problem is also known as ski-rental [17| . 

Dynamic speed scaling was introduced to allow computing systems to reach a balance be- 
tween high performance and low energy consumption dynamically. Hence, scheduling algorithms 



that assume dynamic speed scaling, e.g., Yao et al. 19 , usually execute jobs as slowly as pos- 
sible while ensuring that timing constraints are met. When the energy required to keep the 
processor active is not negligible, however, executing jobs too slow may result in more energy 
consumption. For most realistic power-consumption functions, there exist a critical speed, which 
is most energy-efficient for job execution [6 14 



Irani et al. 14 initiated the study of combining both mechanisms. For offline energy- 
minimization, they presented a 2-approximation. For the online version, they introduced a 
greedy procrastinating principle, which enables any online algorithm for speed scaling without 
power-down mechanism to additionally support the power-down mechanism. The idea behind 
this principle is to postpone job execution as much as possible in order to bundle workload for 
batch execution. The usage of job procrastination with dynamic speed scaling for period tasks 



has later been explored extensively in a series of research pip, 18 . 

The combination of the power-down mechanism with dynamic speed scaling suggests the 
philosophy of racing-to-idle: Execute jobs at higher speeds and gain longer quality sleeping 
intervals. In a recent result, Albers and Antoniadis [l] show that the problem of minimiz- 
ing the energy consumption for speed scaling with a sleep state is NP-hard and provide a 
3 -approximation. 

Our Focus and Contribution. In this paper, we examine the problem of online dynamic 
power management to minimize the energy consumption. We present both lower bounds on the 
problem complexity and algorithmic results. 

First, we show that the competitive factor of any online algorithm for this problem is at 
least 2.06. This shows that this problem is already harder than the ski-rental problem, which 
has a tight competitive factor of 2 that is inherited by a couple of online scheduling problems 



as the only known lower bound 15, 17 



Second, we present a 4-competitive online algorithm that uses at most two processors for 
any given set of jobs known in advance to be packable on a processor. When the execution 
times of the jobs are unit, we show that the competitive factor improves to 3.59. Then, we 
generalize our algorithm for a prescribed collection of job streams to allow a trade-off between 
the number of processors we use and the energy-efficiency of the resulting schedule. 

Note that, our assumption on the input job set is crucial in the sense that packing the jobs 
using a given number of processors is known to be a long-standing difficult problem even for 
the offline case [8 - 10 , 12 , and for the online version only very special cases were studied 10 , 16 



Due to the space limit, some technical details and proofs are provided in the appendix for 
further reference. 

2 Notations and Problem Definition 

In this section, we provide definitions to the scheduling model assumed in this paper, followed 
by a formal problem definition. 

The Jobs. Each job j is associated with three parameters, namely, the arrival time aj, the 
execution time Cj, and the deadline dj. The arrival time of a job is the moment it arrives to the 
system and becomes ready for execution. The execution time is the amount of time it requires 
to finish its task, and the deadline is the latest moment at which the task must be completed. 
We assume that Cj and dj are both known at the moment when j arrives to the system. 



For notational brevity, for any job j, we use a triple j = (aj, dj, Cj) to denote the correspond- 
ing parameters of j. We call a job j a unit job if Cj = 1 and we write j = (aj,dj). Moreover, a 
job j is said to be urgent if Cj = dj — aj. 

Energy Consumption. In this paper, processors for executing jobs are assumed to have 
three states, namely busy, standby, and off. When a processor is off, it cannot execute jobs and 
consumes a negligible amount of energy. Switching a processor from off to other states requires 
E w units of energy. A processor is in busy state when executing a job. The amount of energy 
it consumes per unit of time when busy is denoted by ipb- When a processor is in standby, it 
consumes ip a amount of energy per unit of time. We assume tp a < ?/V For convenience, we use 
the terminology "turning on" and "turning off" to denote the transition between the off state 
and other states. 

Provided the above notion, the break-even time, denoted by B, is defined as E w /i[) a . Literally, 
this corresponds to the amount of time a processor has to stay in standby in order to have the 
same energy consumption as a turn-on operation. Break-even time is an important concept 
that has been widely used for ski-rental-related problems |17| and dynamic power management 



algorithms in the literature, e.g., 13-15 . 



Job Scheduling. Let J be a set of jobs. A schedule S for Jona set of processors Ai is to 
decide for each processor m £ A\: (1) a set I m of time intervals during which processor m is 
turned on, and (2) a function job m (t) : M + — > J of time indicating the job to occupy processor 
m at time t. The schedule S is said to be feasible if for each job j G J , there exist a processor 
m £ Ai such that 

Yl / *(Job m (t),j)-dt > cj, 

where 8(x, y) = 1 if x = y and 5(x, y) = otherwise. The energy consumption of the schedule 
S, denoted E(S), is hence 

E(S) = J2 ( E ™ • l X -l + / ( E b ~ ^(Job m (t), 0) • (E b - E s ) ) ■ dt\ . 

The goal of the Power-Minimizing Scheduling Problem is to find a feasible schedule S such that 
E(S) is minimized. 

In this paper, we consider the case where the jobs are arriving to the system dynamically in 
an online setting, i.e., at any time t, we can only see the jobs whose arrival times are less than 
or equal to t, and the scheduling decisions have to be made without prior knowledge on future 
job arrivals. To be more precise, let J be the input job set and J(t) = {j : j € J , aj < t} be 
the subset of J that consists of the jobs whose arrival times are no greater than t. 

Definition 1 (Online Power-Minimizing Scheduling). For any given set J of jobs, the online 
•power-minimizing scheduling problem is to compute a feasible schedule such that the energy 
consumed up to time t is small with respect to OPT(J(t)), where OPT(J(t)) is the energy 
consumed by an optimal schedule of J{t), for any t > 0. 

The Schedulability of the Jobs. Chetto et al. [7] studied the schedulability of any given 
set of jobs and proved the following lemma. 

Lemma 1 (Chetto et al. [7]). For any set J of jobs, J can be scheduled on one processor using 
the earliest- deadline- first (EDF) principle, which always selects the job with earliest deadline 
for execution at any moment, if and only if the following condition holds: 

For any time interval (I, r), we have >, Cj < r — £. (1) 

j : jdj ' ,t<aj ,dj <r 



Note that, it is well-known that, for any set J of jobs, if there exists a feasible schedule 
for J that uses only one processor, then the EDF principle is guaranteed to produce a feasible 
schedule 111]. Therefore, Condition (Fy) gives a necessary and sufficient condition for any set of 
jobs to be able to be packable on a processor. 

3 Problem Lower Bound 

In this section, we prove a lower bound of 2.06 on the competitive factor of any online algorithm 
by designing an online adversary A that observes the behavior of the scheduling algorithm to 
determine the forthcoming job sequence. 

Let II be an online scheduling algorithm for this problem. We set ipb = ip a = ip = 1 and 
E w = k, where k is an integer chosen to be sufficiently large. Hence the break-even time B is 
also k. Without loss of generality, we assume a length of the minimum tick of the system to 
be eo, which we further assume to be 1. We define a monitor operation of the adversary A as 
follows. 

Definition 2. When A monitors II during time interval [io,£i], it checks if II keeps at least 
one processor on between time to and t\. If S turns off all the processors at some point t between 
to and ti, then A releases an urgent job of length eo immediately at time t + eo, forcing II to 
turn on at least one processor to process it. If II keeps at least one processor on during the 
monitored period, then A does nothing. 

Let x, rj, and x, where < x < g, be three non-negative parameters to be chosen carefully. 
The online adversary works as follows. At the beginning, say, at time 0, A releases a unit job 
(0, B, eo) and observes the behavior of II. Let t be the moment at which II schedules this job 
to execute. Since II produces a feasible schedule, we know that < t < B — 1. We have the 
following two cases. 

Case(l): If < t < (5 — x)B, then A monitors II from time t to \B. 

Case(2): If j is not executed till (| — x^)B, A releases (^ + x)B — eo unit jobs with absolute 
deadline B at time {^ — x)B + €q. As a result, the online algorithm is forced to wake up at 
least two processors to meet the deadlines of the jobs. The adversary continues to monitor LT 
until time (| + ri)B. If no urgent unit jobs have been released till time (| + r))B, A terminates. 
Otherwise, it monitors LT for another yB> units of time until (| + r\ + x)B. 

Let £ (LT) and £{0) denote the energy consumed by algorithm LT and an offline optimal 
schedule on the input sequence generated by A, respectively. The following lemmas give a lower 
bound on the ratio £ (LT)/£(0) for each of the aforementioned cases, expressed as a function of 

k, x, rj, and x- 

Lemma 2. If the first job released by A is scheduled at time t with 0<£< (^ — x^)B, then we 

have £(U) 1 /1\ 2 

-)-( > 2 + -x - O - , for any < x < -. 

£{o) - 2 V fc y ~ ~ 5 

Lemma 3. If the first job released by the adversary is not executed until time (^ — x)B, then 
for any x, ry, x > we have 

£(U) . j 3 + x + T] 4 + x + ?? + x 5 + x + r] + x \ Q /l 



£(0)'-""'\ l+x 2 + x + i] '2 + x + ri + xj " \k 

By combining Lemma [2] and Lemma [3] we obtain the following theorem. 

Theorem 4. The competitive factor of any online algorithm for the online power-minimizing 
scheduling problem is at least 2.06. 



4 Online Scheduling 

We have seen in the last section that, in order to bundle the execution of the jobs while providing 
strict real-time guarantees, additional processors are necessary compared to those required by 
an optimal offline schedule. 

In this section, we first assume that Condition (fTl) from Lemma FT] holds for the input set of 
jobs and present an online strategy that gives an energy-efficient schedule using at most two 



processors. In § 4.4 we generalize our algorithm for a prescribed collection of job streams, each 
of which delivers a set of jobs satisfying Condition (IT]), to allow a trade-off between the number 
of processors we use and the energy-efficiency of the resulting schedule. 

We begin our discussion with a review on the commonly used approaches and their drawbacks 
in our problem model. 

Common Approaches and Bad Examples. A commonly used approach to bundling the 
workload is to delay the execution of the jobs as long as possible until no more space for delaying 
the job execution is left, followed by using the earliest-deadline- first principle to schedule the 
jobs, e.g., the algorithm due to Irani et al. [14] . When there is no more job to execute, the 
ski-rental problem and related scheduling problems [IT] suggest that we stay in standby for B 
amount of time before turning off the processor. Let C denote this approach. 

Lemma 5. The competitive factor of C can be arbitrarily large. Furthermore, even when Cj = 1 
for all j £ J, the competitive factor of C is still at least 6. 

The above lemma shows that, when the jobs can have arbitrary execution times, packing of 
the jobs should be done more carefully. Furthermore, even when we have Cj = 1 for all j £ J , 
there is still room for improvement on the competitive factor. 

Our Main Idea. The examples provided in Lemma[5]give a rough idea on the drawbacks of C, 
which are twofold: (1) Scheduling jobs on different processors using a global priority queue can 
easily result in deadline misses. (2) Blindly delaying the jobs can lead to a less energy-efficient 
schedule. 

The former one is more straightforward to deal with. By suitably partitioning the job set, 
the feasibility can be assured by our assumption on Condition (fTl). For the latter problem, 
we introduce the concept of energy- efficient anchors for the jobs in order to determine the 
appropriate timing to begin their execution. 

4.1 Our Algorithm 

We define some notations to help present our online algorithm S and the analysis that follows. 
Let J be the input set of jobs, and recall that J{t) is the subset of jobs whose arrival times are 
smaller than or equal to t. 

For any t, P with < t < V, let Q(t) be the subset of J{t) that contains the jobs that have 
not yet finished their execution, and let Q(t,tf) be the subset of Q(t) containing those jobs 
whose deadlines are smaller than or equal to t'. Note that, by definition, we have Q(i, t') C 
Q(i) ^ <y(t) — 3 ■ For notational brevity, let c'At) denote the remaining execution time of 
job j at time t, and let W(t) = X^'eQ(t) c'-(i) an d W(t,v) = X^'eort tt) c 'j(t) denote the total 
remaining execution time of the jobs in Q(t) and Q(t, v), respectively. Furthermore, we divide 
Q(t) into two subsets according to the arrival times of the jobs. For any t,t* with < t* < t, 
let Qp roc (t) be the subset of Q(t) containing the jobs whose arrival times are less than t* , and 
letQ i / or4/l (t) = Q(t)\Q^; oc (t). 



Let A, < A < L, be a constant to be determined later. For each job j £ J , we define a 
parameter hj to be max {a,j, dj — XB}. The value hj is referred to as the energy-efficient anchor 
for job j. 

Let Mi and M2 denote the two processors which our algorithm S will manage. We say that 
the system is running, if at least one processor is executing a job. The system is said to be 
off if all processors are turned off. Otherwise, the system is said to be in standby. During the 
process of job scheduling, our algorithm S maintains an urgency flag, which is initialized to be 
false. At any time t, S proceeds as follows. 

(A) Conditions for switching on the processors: 

(a) If the system is off and there exists some j G Q(i) such that hj > t, then turn on 
processor M\. 

(b) If the urgency flag is false and there exists some P with P > t such that W(t, P) > 
P — t, then 

• turn on M\ if it is off, 

• turn on Mi, set t* to be t, and set the urgency flag to be true. 

(B) To handle the job scheduling: 

(a) If the urgency flag is true, then use the earliest-deadline-first principle to schedule 
jobs from Qp roc (t) on Mi and jobs from Q t t ort i l {'t) on Mi. 

(b) If the urgency flag is false and the system is not off, then use the EDF principle to 
schedule jobs from Q(i) on the processor that is on. 

(C) Conditions for turning off the processors: 

(a) If the urgency flag is true and C^ proc {t) becomes empty, then turn off M\ and set the 
urgency flag to be false. 

(b) If the urgency flag is false, the system is standby, and t — t\ > B, where £1 is the time 
processor Mi was turned on, then turn off all processors. 

Note that, Mi and Mi can both be on only when the urgency flag is true. 

4.2 The Analysis 

Let h = (h,ri),h = (4,^), • • • ,I K = {&K,r K ), where n < tj for 1 < i < j < k, be the 
set of time intervals during which the system is either running or in standby. We also refer 
these intervals to as the awaken-intervals. For ease of presentation, let Iq = (0, 0) be a dummy 
awaken-interval. 

Lemma 6. Provided that Condition (fTl) from Lemma [I] holds for the input jobs, algorithm S 
always produces a feasible schedule. 

Below, we bound the competitive factor of S. Let V% be the number of times S turns on a 
processor in /j. We have the following lemma regarding Vi- 

Lemma 7. For each i, L < i < k, we have 

• Vi<2. 

• If Vi = 2, then the amount of workload that arrives after r,_i and has to be done before 
ri is at least XB. 
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Figure 1: The relative positions between sleep intervals of O and awaken-intervals of S. 

Let Jj denote the set of the jobs executed in awaken-interval h in S. As a consequence to 
Lemmaul we have X^eJ c i — ^ ^ or eacn Ii with V{ = 2, 1 < i < k. Let be an optimal offline 
uni-processor schedule. Below, we relate our online schedule to the optimal offline schedule O. 
Suppose that U{ = (£i,opt,i'i,opt)j I < i < m, is the set of intervals for which O keeps the system 
off. We also refer these intervals to as the sleep intervals of O. 

Lemma 8. For any i, 1 < i < k, if both rj_i and li are contained in some sleep interval 
U u = (£ u ,opt, r u ,opt) of O, then V% = \ and n - r Uy0pt > (1 - X)B. 

Lemma 9. Each sleep interval of O intersects with at most two awaken-intervals of S. In 
particular, each of U\ and U m intersects with at most one. 

The above two lemmas characterize the relative positions between the sleep intervals of O 
and the awaken intervals they intersect with. See also Fig. [T] for an illustration. For simplicity, 
we implicitly concatenate U\ and U m , i.e., we assume that the left end of U\ is connected to 
the right end of U m for the rest of this section. 

Below we upper-bound the energy consumption of S by the energy consumption of O. In 
order to help present the analysis, we define the following notations for the energy consumption 
of the system in S and in C: (a) £ Wtff : energy for waking up and standby, (b) £ w : energy 
for waking up, (c) St,: energy for executing jobs, and (d) Eh, a'- energy for executing jobs and 
standby. 

We use E W)(T (S,ti,t2), and £&(<S,£i,i2) to denote the corresponding energy S consumes be- 
tween time t\ and £2- Furthermore, Eh, a {0,ti,t2), and £ w (0,ti,t2) are the corresponding ener- 
gies O consumes between time t\ and £2- 

Lemma 10. For each awaken-interval Ii, 1 < i < k and A < 1, 

£ w , ff (S,li,ri) < ^ Ij^ c o ) + ( 3 - X ) E ™- ( 2 ) 

Furthermore, if V% = 1, then £ w<a (S,£i,ri) < 2E W . (3) 



By combining all the above analysis in Lemmas [7J |8j [9j and 10 for the properties between the 
optimal offline schedule and the schedule derived from S, we prove an upper bound on the 
energy consumption £ W;(T (S,0, 00) in the following lemma. 

Lemma 11. When 2 — v3 < A < 1, we have 

£^(5,0,00) < (4- A) • £ b , a (O,0,oo) + 4- £ w (0,O,oo). 
Proof. Consider each of the following three exclusive cases of an awaken-interval Ii , for 1 < i < 

K. 

Case 1: Interval Ii does not intersect with any sleep interval of O. Therefore, in the optimal 
schedule O, the processor is not off during time interval (£i, n). By our algorithm design, the 



length of an awaken-interval is at least B. Hence £b,cr(0,li,ri) > B ■ i/) a = E w . By Lemma 10 
when A < 1, we have 

£ w A s ^i^i) < i> a ■ Yl c j + ( 3 - A ) • EbAO'ti,*). (4) 



Case 2: Interval Ii intersects with a sleep interval U u = (£ u ,opt, r u,opt) and T % is contained 
in interval U u , We have two subcases: (a) If V% = 1, by Eq. (I3J) in Lemma 10 we have 



Sw,a(S,£i,ri) < 2E W = 2£ w (0,£ U:0 pt,r Ut0p t), in which the equality comes by the definition 
of a sleep interval, (b) If V% = 2, to provide an upper bound to Eq. ^ in Lemma 10 , we use the 



energy consumption £ w (0 , £ u>op t, r U)0pt ) in the optimal schedule to bound 2E W , and the energy 
consumption £^^(0 , r^i, £ Uj0pt ) in the optimal schedule to bound (1 — X)E W . We know that, 
when2- v / 3<A<l,wehave X _ A < A(3 _ A) . (5) 

By Lemma 171 in O, the workload that has to be done between n~i and £ Ufi pt is lower-bounded 
by XB. Since rj > £ u ,opt, the workload that has to be done between n-% and r{ is also lower- 
bounded by XB, and hence, 

£ b>(T (0, ri-i,n) >^-XB = XE W , (6) 

Hence, by Eq. ^ and Eq. ([6]), we have (1 — X)E W < (3 — A) • £& j<T (0, ri_i,ri). By rephrasing 
Eq. ([2]) in Lemma [10| for both subcases, we have 

Sv^iS, £i,n) < fa J2 c i + ( x " A ) • E ™ + 2 • E ™ 
- ^ CT ' E c i + ( 3 ~~ A ) ■ £ b,cr{.0,ri-i,ri) + 2-£ w (0,£u,opt,r u ,opt)- (7) 

Case 3: ij intersects exactly with one sleep interval U u = (£ u ,opt, r u,opt) and £{ is contained in 
interval U u , then, by Lemma [8j we have Vi = 1 and therefore 

£"M>,aA'- , > tj, TjJ v. z • .C/^ 5: -^ " C"W\S~' i ^u,opti ^u,opt)- \PJ 

Provided the individual upper bounds for each awaken interval in Eq. Q, Eq. (I7j), and Eq. ([8]), 
we combine them to get an overall upper bound for £ w>a -(S, 0, 00), which is the summation over 
each awaken interval, Yli<i< K ^w,a{S,£i,ri). 

First, since the awaken intervals are mutually disjoint, the item £b iff (0,ri—i,ri) is counted 
at most once for each i, 1 < i < k. Note that, ^i <i<K £b,a(0, r i-i, r i) < £b,criP,Q, 00). Second, 
by Lemma [9j a sleep interval U u intersects with at most two awaken-intervals of S. Hence, 
£ W (0,£ M , r u ) is counted at most twice. Therefore, when 2 — \/3 < X < 1, we have 

£ w ,cr(S,Q,oo)= ^2 £ w ,a{S,£i,ri) 

Ki<K 



. n 



^ E ^ • E c j + ( 3 - A ) • E f ^(°> r *-i' 

l<i<K V jeJi J l<i<K 

t 4 > &w\^ 1 £u,opti1"u,opt) 

l<u<m 

<(4 - A) • £ 6)<r (0, 0, 00) + 4 • £ W (C>, 0, 00), 
where the last inequality comes from the assumption that ip a < V&- D D 

Theorem 12. By setting A = 1, algorithm S computes a 4-competitive schedule for any given 
set of jobs satisfying Condition (fij) for the online power-minimizing scheduling problem. 

Proof. The feasibility of S is guaranteed by Lemma rol For the competitive factor, let £{S) 
and £ (C) denote the total energy consumption for schedule S and O, respectively. First, we 
have £b(S,0, 00) < £& jCT (O,0,oo), since the workload to execute is the same in both 5 and O. 



Together with Lemma 11, we have £{S) = £b{S, 0, oo) + £ Wjff (S, 0, 00) < (5 — A) -8^(0, 0, 00) + 
4 • £ w (0, 0, 00) < (5 — A) • £(0), provided that 2 — a/3 < A < 1. By choosing A to be 1, we have 
£(S) < 4 • £{0) as claimed. □ □ 



4.3 Jobs with Unit Execution Time 

We show that, when the execution times of the jobs are unit, we can benefit even more from the 
energy-efficient anchor with a properly chosen parameter A = 4 — \/l0. The major difference is 
that, when the system is in urgency while Q^ ort / v (i) is empty, i.e, processor M2 is in standby, 
we use a global earliest-deadline-first scheduling by executing two jobs on Mi and M<i instead 
of keeping one processor in standby, which improves resource utilization. Let S' denote the 
modified algorithm. 

As a result, both processors will keep running until the global ready queue is empty. Hence, 
we know that, when V% = 2 for an awaken-interval Ii of S\ the total standby time in Ij is at most 



(l — 2)^- The following two lemmas are the updated versions of Lemma 10 and Lemma 11 
respectively. 

Lemma 13. When the jobs have unit execution times, for each awaken-interval Jj, 1 < i < k, 
we have £ w , a {S\ti,n) < (3- £)E W . 



Lemma 14. When 7 — v41 < A < 4 — v 10, we have 

£^(^,0,00) <U-^Y £ M (0,O,oo) + U- ^ ) • f ir (0. 0. x ). 



By Lemma [l4] with A chosen to be 4 - JlO, we get £(S*) < (4- f) • £{0) < 3.59 • £{0). 
We have the following theorem. 

Theorem 15. By setting A = 4 — VlO, algorithm S^ computes a 3.59-competitive schedule 



for any given set of jobs with execution time that satisfies Condition (flj) for the online power- 
minimizing scheduling problem. 

4.4 Trading the Number of Processors with the Energy-Efficiency 

We have shown how a stream of jobs satisfying Condition (fij) can be scheduled online to obtain 
a 4-competitive schedule which uses at most two processors. By collecting the delayed jobs 
and bundling their execution, we can generalize the algorithm to a prescribed collection of 
job streams, each of which satisfies Condition (fTl), to allow a trade-off between the number of 
processors we use and the energy-efficiency of the resulting schedule. 

Let Ji, J2, ■ ■ ■ , Jk De the given collection of job streams, where Ji satisfies Condition (flj) 

for all 1 < % < k. Below we show that, for any fixed h, h > k, how a (4 • max 



h-k 

competitive schedule which uses at most h processors can be obtained. First, if h > 2k, then 
we simply apply the algorithm S on every pair of the streams, i.e., on Ji% and Ja+i, f° r 
all 1 < i < |, and we get a 4-competitive schedule. For the case k < h < 2k, we divide 
J11J2, ■ ■ ■ , Jk m to h — k subsets such that each subset gets at most \k/(h — k)~\ streams. The 
h processors are allocated in the following way. Each stream of jobs gets one processor, and 
the remaining h — k processors are equally allocated to each of the h — k subsets. Then, the 
problem is reduced to the remaining case, h = k + 1. 

Below we describe how the case h = k + 1 is handled. Let Mo, Mi, . . . , M& denote the 
k + 1 processors to be managed, and let Qo, Qi, • • • , Qk be the corresponding subset of jobs 
that are scheduled to be executed on each processor. We use the parameter A = 1 to set the 
energy-efficient anchor for each job that arrives. Let Wo(t, tf) = YljeQo rf<*t c j(*) denote the 
total remaining execution time of the jobs in Qo whose deadlines are less than or equal to t\ 
for any time t and any V with V > t. The algorithm works as follows. When a job j G Ji, 
1 < i < k, arrives, if Mj is on, then we add j to Qj. Otherwise, we further check whether 
Wo(t,tf) + Cj < P — t holds for all v > dj. If it does, then j is added to Qo- Otherwise, we 
add j to Qi and turn on the processors Mj and Mq (if Md is off). At any time t, we have the 
following cases to consider. 



• First, regarding the further conditions to turn on Mq, if Mq is off, and if the energy-efficient 
anchor of some job in Qo is met or there exists some V > t such that Wo(t,t') > P — t, 
then we turn on Mq. 

• Second, regarding the scheduling, for each < i < k such that Mi is on, we use the 
earliest-deadline- first principle to schedule the jobs of Qj on Mj. 

• Finally, regarding the conditions for turning off the processor, when Qo becomes empty 
and Mq has been turned on for at least B amount of time, we turn off Mq. For 1 < i < k, 
if Mi is on, Qj becomes empty, and Mq is off, then we turn off M,. 

Theorem 16. Given a collection of A; job sets, each satisfying Condition (fTl), we can compute 
a f 4 • max < -^—^ , 1 > J -competitive schedule that uses at most h processors, where h > k, for 
the online power-minimizing scheduling problem. 

5 Conclusion 

We conclude with a brief overview on future research directions. With the advances of technol- 
ogy, racing-to-idle has become an important scheduling model for energy efficiency. For online 
settings, it still remains open how a balance between (1) the extent to which the executions 
of the jobs should be postponed, and (2) the extent to which the system should speed up for 
impulse arrivals, can be reached or even be traded with other parameters. We believe that this 
will be a very interesting research direction to explore. 
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A Problem Lower Bound 

Lemma 2l If the first job released by A is scheduled at time t with < t < ^ — x^jB, then 
we have 

>2 + -x-0[-) foranyO<x<-. 



£{0) ~ 2 V^ 

Proof of Lemma^ Let m be the number of jobs A releases, including the first job at time 0. 
Suppose that these m jobs are indexed in an ascending order according to their arrival time. 
By definition a± = and a m < |S. 

Hence, the online algorithm II turns on a processor at least m times, in which the energy 
consumption for switching on is at least m ■ E w . Moreover, except for (m — l)eo units of time 
when no processor is on, II keeps a processor on for (W — t) — (m — l)eo time units in the time 
interval [t, ^]. Therefore £(H) is at least mE w + (W — t)ip — (m — l)eoifj, which is at least 
mE w + (1 + x)Bip — (m — l)ip. 

Then, we give the upper bound for the energy consumption of an optimal offline schedule by 
constructing a specific feasible offline schedule for these m jobs. Note that, the job A releases 
at time is flexible and can be scheduled at any time between t and B — 1. Specifically, we 
consider 4 cases, depending on the value of m. 

When m = 1, we have only one job to execute and one feasible schedule is to turn on the 
processor at any time t < B — 1 followed by immediately turning off. Therefore, for m = 1, 
£{0) <E w + ip, and 

m > w ±xl _ (i 

When in = 2, there are two subcases. (1) If 02 < £>, then the two jobs can be scheduled to 
run consecutively and the energy consumption is E w + (1 + eo)^- (2) If 02 > B, the first job can 
be scheduled to run at time B — l, and the processor idles from B to 02 and executes the second 
job at time 02- The energy consumption is E w + (02 — B + (1 + eo))ip, which is maximized when 
02 = \B. Therefore, when m = 2, for both subcases we have £{0) < E w + (^ + 2)^, and 

£(n) >*(3 + ,)-l = » _ [i 



£(£>) - f + 2 3 \k 

When m = 3, there are two subcases as well. (1) If 03 — a<i > B, then 02 < B, and the first 
two jobs can be scheduled to run consecutively. The processor is off from 02 + eo to 03. The 
energy consumption for this subcase is 2E W + (1 + 260)^- (2) If 03 — «2 < B, we schedule the 
first job at any time between ^B and B — l. Disregarding 02 < B or not, the energy consumption 
for this subcase is at most E w + (£> + eo)^- Therefore, when m = 3, we have £{0) < 2E W + 3i/>, 

and 

£(U)^k(i + x)-2 =2+ l_ x _ fl 



£{0) 2k + 3 2 \k 

When m > 4, we keep the processor on from time t to time W + cq. The energy consumption 
for the optimal offline schedule is at most E w +(^ — t + l\ip, and 

£(U) 4fc+(f -t) -3 _ llk-2t-6 

£jO) ~ k+f-t + 1 ~ 5k-2t + 2' 

Since £(H) > £(0) > and t > 0, the above fraction achieves its minimum at t = 0. Hence, 
when m = 4, we have 

£(U) Ilk -6 _ 11 (1 



£{0) ~ 5k + 2 5 \k 
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Combining the above four cases, we get 

W)- min { 2+x - 2+ h 2+1 2 x -j}- {i)^ 2+ r- {b- 

when k is large enough and < x < g. D D 

Lemma 3 If the first job released by the adversary is not executed until time (| — x)B, then 
for any x, 77, x > we have 

£(II) . J3 + x + T]4: + x + r] + x5 + x + r] + x\ „l '1 
——— > mm < — 5 , , > — (J — 

£ (O) ~ \ l+x 2 + x + n , 2 + x + ri + X j \k 

Proof of Lemma^ By our design, A releases exactly (| + %)B jobs before time B, forcing II 
to use at least two processors to have a feasible schedule. Let m\ be the number of the urgent 
unit jobs that A releases between time B and time (| +r])B. If mi = 0, then II keeps at 
least one processor in the standby mode till time (| + n)B, and the energy consumption is 
£ (II) > 2 • E w + (h + x)B ■ ip + (J + ri)B ■ i{j ■ A feasible offline schedule can execute all these 
jobs consecutively on a processor from time (^ — x)B to B, with energy consumption equal to 
E w + (5 + x )& ' V'- Therefore, for mi = 0, we know that £{0) < E w + (\+ x)B ■ if), which also 
implies 

g(n) . 3 + x + V 

£{0)~ l + x ■ W 

For the other case with m\ > 1, the adversary A monitors the behavior of LI till time 
(| + r] + x)-B. Let rri2 be the number of the urgent unit jobs A releases after time (| + rj)B. 
Then by an analogous argument, we have 5(11) > (2+7711+7712) --EVo + Q + x)i3-V , + (^ + i] + x)&' 
ip—(mi + rfi'i)^. Note that this function is minimized when mi and 777.2 achieve their minimums. 

A feasible offline schedule on a processor for mi > 1 is constructed by turning on the 
processor at time (j — x)B and executing the coming jobs. If 7712 = 0, then we turn off the 
processor at time (| + ??)£>, and, hence, £{0) < E w + (1 + x + r])Bip. Otherwise, we turn the 
processor off at time (§+?? + x)#, and, hence, £{0) < E w + (1 + x + r\ + x)^- Therefore, 



cvm ( (3+m 1 +x+ri+x)k-m 1 

(3+mi+m2+x+q+x)fc-(mi+m2) 
(2+x+»;+x)-fc 



£(0) ~ min 



>min( 4 + X + ?? + X , 5 + :E + ?? + X l-of^ (10) 

1 2 + X + 7/ '2 + X + 7/ + XJ V fc / 

By Eqs. ([9]) and (10), the lemma is proved. □ □ 



Theorem |4[ The competitive factor of any online algorithm U for the online power-minimizing 
scheduling problem is at least 2.06. 

Proof of Theorem [^} By Lemmas [2] and [3j we know that 

£(II) . l4 + x 3 + x + t/ 4 + X + T/ + X 5 + x + 7/ + xl ^/l 



> sup min < , — 5 , , > — O . , 

^(O) _ 0<«<2/5 1 2 § + Z 2 + X + 7, '2 + X + 7/ + XJ V^ 

0<r;,x 

To get an asymptotic sup-min of the four items inside the brace, observe that, as x increases, 
the third item increases while the fourth item decreases. Therefore, the value of x f° r which 
the last two items achieve their max-min can be solved. In this way, we can solve rj and x to 
get the asymptotic lower bound for £'U)/£(0). 

By choosing x, rj, and x to be 0.1218, 0.2206, and 0.4852, respectively, we get £(H) > 
2.06 • £(0) when k > 1. □ □ 
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B Online Scheduling 

B.l Common Approaches and Bad Examples 

Lemma [5]. The competitive factor of C can be arbitrarily bad. Furthermore, even when Cj = 1 
for all j E J , the competitive factor of £ is still at least 6. 

Proof of Lemma^ For the first part of this lemma, consider the following job set, J e df- Let k 
be a non-negative integer. At time 0, a job j\ = (0, 4k, 3k) arrives and is delayed. The system 
remains in standby until time k, at which k — 1 unit jobs arrive to the system with deadline 
4k — 1. As a result, j\ will miss its deadline unless we turn on k processors. This shows that, the 
number of processors C will use for J e( w can be arbitrarily large, whereas the optimal schedule 
uses only one processor, and the competitive factor can therefore be arbitrarily large. 

For the second part of this lemma, we define another job set J. Let n > be an even integer. 
The job set J consists of \n jobs: ji,j 2 , ■ ■ -,jn, j'1,3'2, ■ ■ ■ ,f n / 2 , where 

fj t = ((» - 1)£ + 2t, %B + 2» + 1), for each 1 < i < n, and 
\j' k = (2kB + 4k, 2kB + 4k + 1), for each 1 < k < §. 

Let £(C) and £(0) be the energy consumed by C and the offline optimal schedule for the 
input instance J, respectively. We prove this theorem by showing that £(C) is asymptotically 
no less than 6 • S(0). 

First we consider the behavior of the algorithm C. At the beginning, C postpones the 
execution of j\ until time B + 2, for which it turns on a processor to execute ji, followed by 
staying in standby for B units of time until it turns off the processor at time 2B + 3. At time 
2B + 4 when L will schedule j% to execute, an urgent job j[ arrives. As a result, C has to turn on 
another processor for it. Then C keeps both processors in standby followed by turning them off 
at time 3Z3 + 5. The same pattern is repeated for the remaining ^p — 3 jobs. See also Fig 2 for an 
illustration. For each repetition of the pattern, say, for jobs J2k-i,J2k, and j' k , 1 < k < S, the 
energy consumption of C is at least 3 • E w + 2>B ■ Tp a = QE W . Therefore £(C) > \ ■ 6E W = 3nE w . 



Figure 2: Dotted horizontal lines represent the periods for which the processor is in the standby 
mode, and double vertical lines represent the turn-ons and turn-offs. 

A feasible schedule for J, denoted by J 7 , can be constructed as follows. For each 1 < k < §, 
we turn on a processor at time max{0, (2k — 2)B + 4k — 5} and turn off the processor at time 
(2k — 2)B+4k — l. We schedule J2k-l a t the time it arrives, which is at (2k — 2)B+4k — 2. Then, 
J2k and j' k are scheduled at 2kB + 4k — 1 and 2kB + 4k, respectively. The energy consumption of 
this schedule, which is also an upper-bound of £ (O), is at most (§ + l)(E w + 4?/^). Therefore, 

£(C) 3nE w / 1 \_ (1 

£(0)~ (l + l)(E w + 4^ b ) \E w + ^ b J \n 

which converges to 6 when E w ^> ip b > 1 and n>l. □ □ 

B.2 The Analysis 

Lemma pi Provided that Condition pi) from Lemmaul holds for the input jobs, algorithm S 
always produces a feasible schedule. 
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Proof of Lemma^ Consider any awaken-interval Ii, where 1 < % < k. If the urgency flag 
is never set true within F, then we know that only M\ is turned on during Jj, and we have 
W(£i,t) < t — £i for all t > £i at time £i when processor M\ is turned on. Moreover, by the 
design of S, once Mi is on, it keeps executing jobs using the EDF principle until the queue Q(t) 
becomes empty. Therefore by Lemma [TJ the schedule S produces during Ii is feasible. 

On the other hand, if the urgency flag is set true at some moment t* between ii and r,-, 
then by a similar argument, we know that the schedules S produces for the subsets Qp roc (t) 
and Q t f ort h('t) are feasible on Mi and M%, respectively, for all £i < t < ri. □ □ 

Lemma [7[ For each i, 1 < i < k, we have 

• Vi<2. 

• If "Pi = 2, then the amount of workload that arrives after rj_i and has to be done before 
ri is at least XB. 

Proof of Lemma^7\ If the urgency flag is never set true in Ii, then V% = 1. Otherwise, suppose 
the system enters and leaves urgency at time t* and v ', respectively. By our scheme, Q Proc (tf) 
is empty, and W(V ,t) < t — V for all t > V by our assumption on Condition (lip . Therefore S 
will not turn on M\ again before Ii ends. This shows V% < 2. 

Suppose that S turns on M2 at time t*, and V is the corresponding moment for which 
W(t*,v) > v — t*. Then by the design of S, we know that the ready queue is never empty 
between time £{ and t*, for otherwise it will contradict Condition (nj) since W(t*,v) > V — t* . 
Therefore the system stays in the running mode and never enters the standby mode between 
time £i and t* . 

Since we use EDF principle, the workload that has to be done before v is hence strictly 
greater than v —£%. Namely, among these jobs, at least one, say, j, has arrival time earlier than 
£{. That is, rj_i < ctj < £%. Therefore dj > £{ + XB; otherwise S would have turned on Mi 
earlier than £i when the energy-efficient anchor of j is met. Therefore the total workload that 
is done within Ij is at least v — £\ > dj — £i > XB. D D 

Lemma [8l For any i, 1 < % < k, if both n—i and £i are contained in some sleep interval 
U u = (£ u ,opt, r u ,o P t) of O, then T>i = \ and n - r U)0pt > (1 - X)B. 

Proof of Lemma^ Since Ij_i ends when the ready queue is empty, the arrival time of each job 
in Jj is later than rj_i. Therefore V% = 1 as O is a feasible uni-processor schedule which turns 
on the system to process jobs no earlier than £^. 

Let j be the job scheduled to execute at time £i. Since we use EDF principle, dji > dj > r Ufipt 
for all j' G Q(£i)- There are two cases to set t- t in S: (1) the jobs in Q(4) will miss the deadline 
if the system is not turned on for processing at time £i, and (2) the energy-efficient anchor of 
a certain job is £{. For the first case, the sleep interval U u cannot contain £j. Hence, when £i 
is contained in U u , we have dj — £i < XB. Therefore, ri — r Ufipt > r^ — dj > rj — (XB + £i) > 
(l-A)B. ' D D 

Lemma [9j Each sleep interval of O intersects with at most two awaken-intervals of S. In 
particular, each of U\ and U m intersects with at most one. 

Proof of Lemma^ This lemma follows directly from Lemma |8j For 1 < i < k — 2, if r» and 
4+i are contained in a sleep interval U u = {£ u ,opt-,r UfiP t) of O, then n+i —r Ufipt > (1 — A)^ > 0, 
which implies that £j+2 will not be contained in U u . □ D 
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Lemma 10, For each awaken-interval Ii, 1 < i < k and A < 1, 



£ w ,cr(S, ii, n) <i ^Cj +(3- X)E U 
Furthermore, ifVi = 1, then £ w , a {S,£i,ri) < 2E U 



Proof of Lemma 10, If Vi = 1, then the total idle time in F is at most B, since in our design, 
S will turn off the processor immediately once Q(t) is empty when t > l{ + B. Therefore 

£w,a{S, ii,ri) < E w + tp a ■ B = 2E W , which is at most t/> ct • ( J2jeJ Cj ) + (3 — \)B when A < 1. 

If Vi = 2, only processor Mi could be in the standby mode according to our design. For 
the total standby time in Ij, we have two cases to consider: (1) If the system exits urgency 
after Hi + B, then the total standby time on processor M2 is upper-bounded by the workload 
on processor M\, which is at most X^'ej c j- (2) If the system exits urgency before £{ + B, the 
total standby time is at most B, which is at most Y^jeJ- c o + (1 — ^)^ since X)jeJ- c j — ^ 
by Lemma Fn In both cases, the total standby time is at most X^ej c j + (1 ~~ A)i3. Therefore 

£w,a(S,£i,ri) < 2E W + t/v • (X^igj c j + (1 — ^)^J- By Lemma 7 we only have to consider 
Vi = 1 and Vi = 2, and this concludes the proof. □ □ 



B.3 Jobs with Unit Execution Time 



Lemma |13[ When the jobs have unit execution times, for each awaken-interval Ii, 1 < i < k, 
we have £ w ^ a {S\li,ri) < (3 — ij)E w . 

Proof of Lemma [T3[ Since the total time in Ii is at most (l — j)S, we have 
£ w , a [S\ii,Ti) <2E W + (l - ^jB • ^V < (3-^J^. 



D 



D 



Lemma 
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Mien 7 - V41 < A < 4 - VlO, we have 



£ w , a (S\0,oo)< (s-^J -^(0,0,00) +U-^j -^(0,0, 00). 



Proof of Lemma 14 ■ We sketch only the main differences from the proof of Lemma 11 and adopt 
the remaining detail. For any awaken-interval Ii of S', 

• If Ii does not intersect with any sleep interval of S^ , then 

A 



£ w , ff (s\ei,n) < (s-^J 



■ F ir < |3- - ) '£ bt<r (0,£i,n). 



If F intersects with a sleep interval U u = (£ u ,opt, r u ,opt) and n is contained in U u , we have 
two subcases, (a) If Vi = 1, then 

£"ui,crv'- > i^it^i) — ^^w — ^^tul,*-^) £u,opti fu,opt)- 
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(b) If Vi = 2, then S bi(T (0 , n-i, £ Uj0pt ) > XE W by Lemma |7| When 7 - V41 < A < 1, we 
have 

{l-X)E w <xh-^jE w . 
Therefore, 



S W!(T (S\£i, n) <(l-±\ E w + 2E U 



< ( 3- - )S b ^(0,n-i,ri) + 2S w (0,£ Ut0p t,r u ,opt) 



If I{ intersects with exactly one sleep interval U u = (£u,opt,f u ,opt) and li is contained in 
U u , then by Lemma[8j V{ = \ and £ b>lT (0, r U)0pt , n) > (1 - A)£ w . When < A < 4 - y/l6, 



we have 



Therefore, 



^ w <(l-A)(3-^)i:„. 



S Wj(T (S\£i,n) < 2E W = ^E 2 +h- ^JE W 

_; I "J 7T I ^b,c7\S-^ i Tu,opti T"i) ~t~ \ ^ TtI '-■uil.LA *~u,optt Tu,opt) 



From the above three cases, when 7 — v41 < A < 4 — v 10, we have 

£*,„(£*, 0,oo)< (s-^J ■£ bjff (O,0,oo) + U-^\ -£ w (O,0,oo). 

a a 

B.4 Trading the Number of Processors with the Energy-Efficiency 

Theorem 16, Given a collection of k job sets, each satisfying Condition pp, we can compute 
a I 4 • max < j^ , 1 > J -competitive schedule which uses at most h processors, where h > k, for 
the online power-minimizing scheduling problem. 



Proof of Theorem 16, The feasibility of the modified algorithm follows directly from our as- 
sumption on Condition (fiT) and Lemma [11 

For the competitive factor of the algorithm, it suffices to consider the case k < h < 
2k. Consider the partition of the collection of job streams. Without loss of generality, let 
JiiJii ■ ■ ■ 1 J\k/{h-k)'] be one subset, and let Mq, M\, . . . , Mrfc/Oj-fc)] be the processors allocated 
to these job streams. Consider any specific stream Ji, 1 < i < \k/(h — k)~\, and the two pro- 
cessors Mq, and Mj. Observe that the lemmas presented in § |4.2| still holds for Mq and Mj. 
Therefore the energy consumed by Mq and Mj is bounded by four times the energy consump- 
tion of any offline optimal schedule for J\, J2, ■ ■ ■ , J\k/(h-k)'] ■ Since we have exactly \k/{h — k)~\ 
such pairs, each unit of energy consumption in the offline optimal schedule, including switch-on 
operations, energy for executing jobs, and energy for standby, is charged at most \k/(h — k) 
times. Therefore, by taking all the pairs Mq and Mi into account, we get the factor 4 • j^. 
as claimed. □ □ 
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